Toward a Noncommutative Arithmetic-geometric Mean Inequality: Conjectures, Case-studies, and Consequences
نویسندگان
چکیده
Randomized algorithms that base iteration-level decisions on samples from some pool are ubiquitous in machine learning and optimization. Examples include stochastic gradient descent and randomized coordinate descent. This paper makes progress at theoretically evaluating the difference in performance between sampling withand without-replacement in such algorithms. Focusing on least means squares optimization, we formulate a noncommutative arithmetic-geometric mean inequality that would prove that the expected convergence rate of without-replacement sampling is faster than that of with-replacement sampling. We demonstrate that this inequality holds for many classes of random matrices and for some pathological examples as well. We provide a deterministic worst-case bound on the gap between the discrepancy between the two sampling models, and explore some of the impediments to proving this inequality in full generality. We detail the consequences of this inequality for stochastic gradient descent and the randomized Kaczmarz algorithm for solving linear systems.
منابع مشابه
Commentary on "Toward a Noncommutative Arithmetic-geometric Mean Inequality: Conjectures, Case-studies, and Consequences"
In their paper, Recht and Ré have presented conjectures and consequences of noncommutative variants of the arithmetic mean-geometric mean (AM-GM) inequality for positive definite matrices. Let A1, . . . , An be a collection of positive semidefinite matrices and i1, . . . , ik be random indices in {1, . . . , n}. To avoid symmetrization issues that arise since matrix products are non-commutative...
متن کاملBeneath the valley of the noncommutative arithmetic-geometric mean inequality: conjectures, case-studies, and consequences
Randomized algorithms that base iteration-level decisions on samples from some pool are ubiquitous in machine learning and optimization. Examples include stochastic gradient descent and randomized coordinate descent. This paper makes progress at theoretically evaluating the difference in performance between sampling withand without-replacement in such algorithms. Focusing on least means squares...
متن کاملBest Upper Bounds Based on the Arithmetic-geometric Mean Inequality
In this paper we obtain a best upper bound for the ratio of the extreme values of positive numbers in terms of the arithmetic-geometric means ratio. This has immediate consequences for condition numbers of matrices and the standard deviation of equiprobable events. It also allows for a refinement of Schwarz’s vector inequality.
متن کاملA Relationship between Subpermanents and the Arithmetic-Geometric Mean Inequality
Using the arithmetic-geometric mean inequality, we give bounds for k-subpermanents of nonnegative n × n matrices F. In the case k = n, we exhibit an n 2-set S whose arithmetic and geometric means constitute upper and lower bounds for per(F)/n!. We offer sharpened versions of these bounds when F has zero-valued entries.
متن کاملOn Inequalities for 2F1 and Related Means
This informal working paper provides an expository survey of some inequalities and associated conjectures involving the Gaussian hypergeometric function 2F1 and closely related bivariate means. Recent as well as previously established results are presented for which the conjectures are known to hold. This basic investigation begins with the fundamental concept of a bivariate mean which is defin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012